Transmembrane serine protease overexpressed in ovarian carcinoma and uses thereof

ABSTRACT

The present invention provides a transmembrane serine protease TADG-12 protein, splice variants of the TADG-12 protein and DNA fragments encoding such proteins. Also provided are vectors and host cells capable of expressing the DNAs. The present invention further provides various methods of early detection and therapies of associated ovarian and other malignancies by utilizing the DNAs and/or proteins disclosed herein.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a divisional of U.S. Ser. No. 10/357,175, filed Feb. 3, 2003, which is a continuation-in-part application and claims benefit of priority under 35 U.S.C. §120 of U.S. Ser. No. 09/650,371, filed Aug. 28, 2000, now U.S. Pat. No. 6,942,978, which is a divisional application of U.S. Ser. No. 09/518,046, now U.S. Pat. No. 6,294,663, which claims the benefit of priority under 35 U.S.C §120 of U.S. Ser. No. 09/261,416, now U.S. Pat. No. 6,291,663.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates generally to the fields of cellular biology and diagnosis of neoplastic disease. More specifically, the present invention relates to a transmembrane serine protease termed Tumor Associated Differentially-Expressed Gene-12 (TADG-12), which is overexpressed in ovarian carcinoma.

2. Description of the Related Art

Tumor cells rely on the expression of a concert of proteases to be released from their primary sites and move to distant sites to inflict lethality. This metastatic nature is the result of an aberrant expression pattern of proteases by tumor cells and also by stromal cells surrounding the tumors (1-3). For most tumors to become metastatic, they must degrade their surrounding extracellular matrix components, degrade basement membranes to gain access to the bloodstream or lymph system, and repeat this process in reverse fashion to settle in a secondary host site (3-6). All of these processes rely upon what now appears to be a synchronized protease cascade. In addition, tumor cells use the power of proteases to activate growth and angiogenic factors that allow the tumor to grow progressively (1). Therefore, much research has been aimed at the identification of tumor-associated proteases and the inhibition of these enzymes for therapeutic means. More importantly, the secreted nature and/or high level expression of many of these proteases allows for their detection at aberrant levels in patient serum, e.g. the prostate-specific antigen (PSA), which allows for early diagnosis of prostate cancer (7).

Proteases have been associated directly with tumor growth, shedding of tumor cells and invasion of target organs. Individual classes of proteases are involved in, but not limited to (1) the digestion of stroma surrounding the initial tumor area, (2) the digestion of the cellular adhesion molecules to allow dissociation of tumor cells; and (3) the invasion of the basement membrane for metastatic growth and the activation of both tumor growth factors and angiogenic factors.

For many forms of cancer, diagnosis and treatment has improved dramatically in the last 10 years. However, the five year survival rate for ovarian cancer remains below 50% due in large part to the vague symptoms which allow for progression of the disease to an advanced stage prior to diagnosis (8). Although the exploitation of the CA125 antigen has been useful as a marker for monitoring recurrence of ovarian cancer, it has not proven to be an ideal marker for early diagnosis. Therefore, new markers that may be secreted or released from cells and which are highly expressed by ovarian tumors could provide a useful tool for the early diagnosis and for therapeutic intervention in patients with ovarian carcinoma.

The prior art is deficient in the lack of the complete identification of the proteases overexpressed in carcinoma, therefore, deficient in the lack of a tumor marker useful as an indicator of early disease, particularly for ovarian cancers. Specifically, TADG-12, a transmembrane serine protease, has not been previously identified in either nucleic acid or protein form. The present invention fulfills this long-standing need and desire in the art.

SUMMARY OF THE INVENTION

The present invention discloses TADG-12, a new member of the Tumor Associated Differentially-Expressed Gene (TADG) family, and variant splicing forms of TADG-12 (TADG-12V and TADG-12D) that could lead to a truncated protein product. TADG-12 is a transmembrane serine protease overexpressed in ovarian carcinoma. The entire cDNA of TADG-12 has been identified (SEQ ID NO: 1). This sequence encodes a putative protein of 454 amino acids (SEQ ID NO: 2) which includes a potential transmembrane domain, an LDL receptor like domain, a scavenger receptor cysteine rich domain, and a serine protease domain. These features imply that TADG-12 is expressed at the cell surface, and it may be used as a molecular target for therapy or a diagnostic marker.

The present invention encompasses nucleic acids encoding the TADG-12 protein and its splicing variants, vectors and host cells capable of expressing the claimed nucleic acids, as well as isolated and purified TADG-12 and its variant proteins. Specifically, the TADG-12 protein and its variants have an amino acid sequence shown in SEQ ID NOS:. 2, 4 or 154.

The present invention further provides methods of diagnosing a cancer or malignant hyperplasia in a biological sample by detecting the presence of TADG-12 protein or mRNA disclosed herein.

In another embodiment of the present invention, there is provided a method of targeted therapy by administering a compound having a targeting moiety specific for a TADG-12 protein and a therapeutic moiety. Specifically, the TADG-12 protein has an amino acid sequence shown in SEQ ID NOS: 2, 4 or 154.

Other and further aspects, features, and advantages of the present invention will be apparent from the following description of the presently preferred embodiments of the invention given for the purpose of disclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

So that the matter in which the above-recited features, advantages and objects of the invention, as well as others which will become clear, are attained and can be understood in detail, more particular descriptions of the invention briefly summarized above may be had by reference to certain embodiments thereof which are illustrated in the appended drawings. These drawings form a part of the specification. It is to be noted, however, that the appended drawings illustrate preferred embodiments of the invention and therefore are not to be considered limiting in their scope.

FIG. 1A shows that the expected PCR product of approximately 180 bp and the unexpected PCR product of approximately 300 bp using the redundant serine protease primers were not amplified from normal ovary cDNA (Lane 1) but were found in abundance from ovarian tumor cDNA (Lane 2). The primer sequences for the PCR reactions are indicated by horizontal arrows. FIG. 1B shows that TADG-12 was subcloned from the 180 bp band while the larger 300 base pair band was designated TADG-12V. The sequences were found to overlap for 180 base pairs (SEQ ID NO: 5 for nucleotide sequence, SEQ ID NO: 6 for deduced amino acid sequence) with the 300 base pair TADG-12V (SEQ ID NO: 7 for nucleotide sequence, SEQ ID NO: 8 for deduced amino acid sequence) having an additional insert of 133 bases. This insertion (vertical arrow) leads to a frame shift, which causes the TADG-12V transcript to potentially produce a truncated form of TADG-12 with a variant amino acid sequence.

FIG. 2 shows that Northern blot analysis for TADG-12 revealed three transcripts of 2.4, 1.6 and 0.7 kilobases. These transcripts were found at significant levels in ovarian tumors and cancer cell lines, but the transcripts were found only at low levels in normal ovary.

FIG. 3 shows an RNA dot blot (CLONTECH) probed for TADG-12. The transcript was detectable at background levels in all 50 of the human tissues represented with the greatest abundance of transcript in the heart. Putamen, amygdala, kidney, liver, small intestine, skeletal muscle, and adrenal gland were also found to have intermediate levels of TADG-12 transcript.

FIGS. 4A-4B show the entire cDNA sequence for TADG-12 (SEQ ID NO: 1) with its predicted open reading frame of 454 amino acids (SEQ ID NO: 2). Within the nucleotide sequence, the Kozak's consensus sequence for the initiation of translation and the poly-adenylation signal are in bold. In the protein sequence, a potential transmembrane domain is underlined in bold. The low-density lipoprotein receptor-like class A domain is underlined with a solid line. The scavenger receptor cysteine rich domain (SRCR) domain is underlined with a broken line. The residues of the catalytic triad of the serine protease domain are italicized bold, and the beginning of the catalytic domain is marked with an arrow designated as a potential proteolytic cleavage site. The asterisk represents the stop codon that terminates translation.

FIG. 5A shows the 35 amino acid low-density lipoprotein receptor-like class A domain of TADG-12 (SEQ ID NO: 13) aligned with other low-density lipoprotein receptor-like class A domain motifs from the serine protease TMPRSS2 (U75329, SEQ ID NO: 14), the complement subunit C8 (P07358, SEQ ID NO: 9), two LDLR-A domains of the glycoprotein GP300 (P98164, SEQ ID NOS: 11-12), and the serine protease matriptase (AF118224, SEQ ID NO: 10). TADG-12 has its highest similarity with the other serine proteases for which it is 54% similar to TMPRSS2 and 53% similar to matriptase. The highly conserved cysteine residues are shown in bold type.

FIG. 5B shows the scavenger receptor cysteine rich domain of TADG-12 (SEQ ID NO: 17) aligned with other domain family members including the human macrophage scavenger receptor (P21757, SEQ ID NO: 16), human enterokinase (P98073, SEQ ID NO: 19), bovine enterokinase (P21758, SEQ ID NO: 15), and the serine protease TMPRSS2 (SEQ ID NO: 18). Again, TADG-12 shows its highest similarity within this region to the protease TMPRSS2 at 43%.

FIG. 5C shows the protease domain of TADG-12 (SEQ ID NO: 23) in alignment with other human serine proteases including protease M (U62801, SEQ ID NO: 20), trypsinogen I (P07477, SEQ ID NO: 21), plasma kallikrein (P03952, SEQ ID NO: 22), hepsin (P05981, SEQ ID NO: 25), and TMPRSS2 (SEQ ID NO: 24). Cons represents the consensus sequence for each alignment.

FIG. 6 shows semi-quantitative PCR analysis that was performed for TADG-12 (upper panel) and TADG-12V (lower panel). The amplification of TADG-12 or TADG-12V was performed in parallel with PCR amplification of beta-tubulin product as an internal control. The TADG-12 transcript was found to be overexpressed in 41 of 55 carcinomas. The TADG-12V transcript was found to be overexpressed in 8 of 22 carcinomas examined. Note that the samples in the upper panel are not necessarily the same as the samples in the lower panel.

FIGS. 7A-7C show immunohistochemical staining of normal ovary and ovarian tumors which were performed using a polyclonal rabbit antibody developed to a TADG-12 specific peptide. No significant staining was detected in normal ovary (FIG. 7A). Strong positive staining was observed in 22 of 29 carcinomas examined. FIGS. 7B-7C represent a serous and mucinous carcinoma, respectively. Both show diffuse staining throughout the cytoplasm of tumor cells while stromal cells remain relatively unstained.

FIG. 8 is a model to demonstrate the progression of TADG-12 within a cellular context. In normal circumstances, the TADG-12 transcript is appropriately spliced and the resulting protein is capable of being expressed at the cell surface where the protease may be cleaved to an active form. The role of the remaining ligand binding domains has not yet been determined, but one can envision their potential to bind other molecules for activation, internalization or both. The TADG-12V transcript, which occurs in some tumors, may be the result of mutation and/or poor mRNA processing may be capable of producing a truncated form of TADG-12 that does not have a functional protease domain. In addition, this truncated product may present a novel epitope at the surface of tumor cells.

FIG. 9 shows the schematic for TADG-12 and splicing variants TADG-12D and TADG-12V™, transmembrane domain; LDLR, low-density lipoprotein receptor-like domain; SRCR, scavenger receptor cysteine rich domain.

FIG. 10 compares the protease domain of TADG-12 and splicing variants TADG-12D and TADG-12V.

FIG. 11 shows the expression of TADG-12D in normal and carcinoma tissue.

FIG. 12 shows the expression of TADG-12D in various carcinoma tissue.

DETAILED DESCRIPTION OF THE INVENTION

The present invention discloses a serine protease identified using a PCR based strategy. By Northern blot, the largest transcript for this gene is approximately 2.4 kb, and it was found to be expressed at high levels in ovarian tumors while found at minimal levels in all other tissues examined. The full-length cDNA encoding a novel multi-domain, cell-surface serine protease named TADG-12 was cloned. The TADG-12 cDNA is 2413 base pairs long (SEQ ID NO. 1) encoding a 454 amino acid protein (SEQ ID NO. 2). A variant form, TADG-12V (SEQ ID NO. 3), encodes a 294 amino acid protein (SEQ ID NO. 4). Another variant form, a 344 amino acid protein TADG-12D (SEQ ID NO. 154) was encoded by the DNA of SEQ ID NO. 155.

The TADG-12 protein contains a cytoplasmic domain, a type II transmembrane domain, a low-density lipoprotein receptor-like class A domain, an scavenger receptor cysteine rich domain and a serine protease domain. Using a semi-quantitative PCR analysis, it was shown that TADG-12 was overexpressed in a majority of tumors studied. Immunohistochemical staining corroborates that in some cases this protein is localized to the cell-surface of tumor cells and this suggests that TADG-12 has some extracellular proteolytic functions. The variant splicing form TADG-12V is present in 35% of the tumors studied. Another splicing variant TADG-12D is also highly expressed in tumor cells. These variant mRNAs would lead to truncated proteins that may provide unique peptide sequences on the surface of tumor cells.

The TADG-12 protein contains two extracellular domains which might confer unusual properties to this multidomain molecule. Although the precise role of low-density lipoprotein receptor-like class A domain function with regard to proteases remains unclear, this domain certainly has the capacity to bind calcium and other positively charged ligands (9-10). This may play an important role in the regulation of the protease or subsequent internalization of the molecule. The scavenger receptor cysteine rich domain was originally identified within the macrophage scavenger receptor and functionally described to bind lipoproteins. Not only are scavenger receptor cysteine rich domains capable of binding lipoproteins, but they may also bind to molecules as diverse as polynucleotides (11). More recent studies have identified members of this domain family in proteins with functions that vary from proteases to cell adhesion molecules involved in maturation of the immune system (12). In addition, TADG-12, like TMPRSS2 has only four of six cysteine residues conserved within its scavenger receptor cysteine rich domain. This difference may allow for different structural features of these domains that confer unusual ligand binding properties.

At this time, only the function of the CD6 encoded scavenger receptor cysteine rich domain is well documented. In the case of CD6, the scavenger receptor cysteine rich domain binds to the cell adhesion molecule ALCAM (11). This mediation of cell adhesion is a useful starting point for future research on newly identified scavenger receptor cysteine rich domains; however, the possibility of multiple functions for this domain can not be overlooked. Scavenger receptor cysteine rich domains are certainly capable of cell adhesion type interactions, but their capacity to bind other types of ligands should be considered.

FIG. 8 presents a working model of TADG-12 with the information disclosed in the present invention. Two transcripts are produced which lead to the production of either TADG-12 or the truncated TADG-12V proteins. Either of these proteins is potentially targeted to the cell surface. TADG-12 is capable of becoming an activated serine protease while TADG-12V is a truncated protein product that if at the cell surface may represent a tumor specific epitope.

The problem with treatment of cancer such as ovarian cancer remains the inability to diagnose the disease at an early stage. Identifying genes that are expressed early in the disease process such as proteases that are essential for tumor cell growth (13) is an important step toward improving treatment. The availability of the TADG-12 and its splicing variants opens the way for a number studies that can lead to various applications. For example, the TADG-12 and its variant genes or proteins can be used as a diagnostic or therapeutic target in ovarian and other carcinomas. The extracellular ligand binding domains are natural targets for drug delivery systems, and aberrant peptide associated with the TADG-12 protein or its variants may provide excellent targets for immune stimulation. Alternatively, inhibition of enzymes such as TADG-12 may be an effective means for slowing progression of various cancer and improving the quality of patient life.

As described herein, the invention provides a number of diagnostic advantages and uses. For example, the TADG-12 protein or its variants disclosed in the present invention are useful in diagnosing cancer in different tissues since these proteins are highly expressed in tumor cells. Antibodies or antigen-binding fragments thereof which bind to an epitope specific for a TADG-12 protein or its variant proteins are useful in a method of detecting TADG-12 or its variants in a biological sample for diagnosis of cancerous or neoplastic transformation. This method includes the steps of obtaining a biological sample, e.g., cells, blood, plasma, tissue, etc., from a patient suspected of having cancer, contacting the sample with a labeled antibody, e.g., radioactively tagged antibody, specific for TADG-12 or its splicing variants, and detecting the proteins using standard immunoassay techniques. Likewise, a standard Northern blot assay can be used to ascertain the relative amounts of TADG-12 mRNA in a cell or tissue obtained from a patient suspected of having cancer in accordance with conventional Northern hybridization techniques known to those of ordinary skill in the art.

In accordance with the present invention there may be employed conventional molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. Such techniques are explained fully in the literature. See, e.g., Maniatis, Fritsch & Sambrook, “Molecular Cloning: A Laboratory Manual (1982); “DNA Cloning: A Practical Approach,” Volumes I and II (D. N. Glover ed. 1985); “Oligonucleotide Synthesis” (M. J. Gait ed. 1984); “Nucleic Acid Hybridization” [B. D. Hames & S. J. Higgins eds. (1985)]; “Transcription and Translation” [B. D. Hames & S. J. Higgins eds. (1984)]; “Animal Cell Culture” [R. I. Freshney, ed. (1986)]; “Immobilized Cells And Enzymes” [IRL Press, (1986)]; B. Perbal, “A Practical Guide To Molecular Cloning” (1984).

As used herein, “TADG-12 protein” refers to full length TADG-12 protein (SEQ ID NO. 2) as well as splicing variants or truncated forms of TADG-12. These splicing variants or truncated forms are derived from TADG-12 by removing or deleting amino acids from SEQ ID NO. 2 with or without inclusion of addition amino acids. Examples of TADG-12 splicing variants include, but are not limited to, TADG-12V and TADG-12D.

The present invention is directed to DNA fragments encoding TADG-12 and its splicing variants, such as TADG-12V and TADG-12D. The claimed DNAs include those having different codon sequences for TADG-12 due to the degeneracy of the genetic code. The protein encoded by the DNA of this invention may share at least 80% sequence identity, preferably 85%, more preferably 90%, and most preferably 95%, with the amino acids listed in SEQ ID NO: 2 or SEQ ID NO: 4. More preferably, the DNA includes the coding sequence of SEQ ID NO. 1, or a degenerate variant of such a sequence. Preferably, the DNA has the sequence shown in SEQ ID NOs. 1, 3 or 155, and the DNA encodes a TADG-12 protein having the amino acid sequence shown in SEQ ID NOs. 2, 4 or 154.

This invention includes a substantially pure DNA comprising a sequence of at least 15 consecutive nucleotides, preferably 20, more preferably 30, even more preferably 50, and most preferably all, of the region from nucleotides 1 to 2413 of the nucleotides listed in SEQ ID NO: 1, or of the region from nucleotides 1 to 2544 of the nucleotides listed in SEQ ID NO: 3. The present invention also comprises antisense oligonucleotides directed against DNAs encoding TADG-12 (SEQ ID NO. 1) or its splicing variants, e.g. SEQ ID NOS. 3 or 155. Given the teachings of the present invention, a person having ordinary skill in this art would readily be able to develop antisense oligonucleotides directed against these DNA sequences.

The present invention also provides a vector comprising a DNA sequence which encodes a human TADG-12 protein or its splicing variants. The vector comprising in operable linkage an origin of replication, a promoter and a DNA sequence coding for said protein is capable of replication in a host cell. Preferably, the vector contains a portion of the DNA sequences shown in SEQ ID NOS: 1, 3 or 155. Methods which are well known to those skilled in the art can be used to construct expression vectors containing appropriate transcriptional and translational control signals. See for example, the techniques described in Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual (2nd Ed.), Cold Spring Harbor Press, N.Y. Vectors of the invention include, but are not limited to, plasmid vectors and viral vectors. Preferred viral vectors of the invention are those derived from retroviruses, adenovirus, adeno-associated virus, SV40 virus, or herpes viruses. Representative host cells include bacterial cells, yeast cells, mammalian cells and insect cells.

The present invention is also directed to an isolated and purified TADG-12 protein encoded for by the DNA claimed herein. Preferably, the isolated and purified TADG-12 protein has the amino acid sequence shown in SEQ ID NOS: 2, 4 or 154.

By a “substantially pure protein” is meant a protein which has been separated from at least some of those components which naturally accompany it. A substantially pure TADG-12 protein may be obtained, for example, by extraction from a natural source, by expression of a recombinant nucleic acid encoding a TADG-12 polypeptide, or by chemically synthesizing the protein.

Included in this invention are TADG-12 proteins which are encoded at least in part by portions of SEQ ID NO: 1 or SEQ ID NO: 3, e.g., products of alternative mRNA splicing or alternative protein processing events, or in which a section of TADG-12 sequence has been deleted. The fragment, or the intact TADG-12 polypeptide, may be covalently linked to another polypeptide, e.g. which acts as a label, a ligand or a means to increase antigenicity.

In addition to substantially full-length proteins, the invention also includes fragments, e.g., antigenic fragments, of the TADG-12 protein. As used herein, “fragment,” as applied to a polypeptide, will ordinarily be at least 10 residues, more typically at least 20 residues, and preferably at least 30, e.g., 50, residues in length, but less than the entire, intact sequence. Fragments of the TADG-12 protein can be generated by methods known to those skilled in the art, e.g., by enzymatic digestion of naturally occurring or recombinant TADG-12 protein, by recombinant DNA techniques using an expression vector that encodes a defined fragment of TADG-12, or by chemical synthesis. The ability of a candidate fragment to exhibit a characteristic of TADG-12, e.g., binding to an antibody specific for TADG-12, can be assessed by methods described herein. Purified TADG-12 or antigenic fragments of TADG-12 can be used to generate new antibodies or to test existing antibodies, e.g., as positive controls in a diagnostic assay, by employing standard protocols known to those skilled in the art.

The invention also includes a polyclonal or monoclonal antibody which specifically binds to TADG-12 protein or its variants. Standard protocols for monoclonal and polyclonal antibody production known to those skilled in this art are employed. The invention encompasses not only intact monoclonal antibody, but also immunologically-active antibody fragment, e.g., a Fab or (Fab).sub.2 fragment, an engineered single chain Fv molecule, or a chimeric antibody consists of regions derived from different species, e.g., the antigen binding sites are of murine origin and the remaining portions of the antibody are of human origin.

The anti-TADG-12 antibody, or a fragment thereof, may be linked to a toxin or to a detectable label, e.g. a radioactive label, non-radioactive isotopic label, fluorescent label, chemiluminescent label, paramagnetic label, enzyme label, or calorimetric label. One of ordinary skill in the art would know of suitable labels which may be employed in accordance with the present invention. The binding of these labels to antibodies or fragments thereof can be accomplished using standard techniques commonly known to those of ordinary skill in the art. Typical techniques are described by Kennedy et al., (1976) Clin. Chim. Acta 70:1-31; and Schurs et al., (1977) Clin. Chim. Acta 81:1-40.

In another embodiment of the present invention, there are provided a methods for detecting malignant hyperplasia by detecting a TADG-12 protein or its variants, or mRNA encoding these proteins in a biological sample. Preferably, the biological sample is selected from the group consisting of blood, urine, saliva, tears, interstitial fluid, ascites fluid, tumor tissue biopsy and circulating tumor cells. TADG-12 protein or mRNA can be detected by Northern blot, Western blot, PCR, dot blot, ELISA sandwich assay, radioimmunoassay, DNA array chips or flow cytometry. Such methods can be used for detecting ovarian cancer, breast cancer, lung cancer, colon cancer, prostate cancer and other cancers in which TADG-12 is overexpressed.

In still another embodiment of the present invention, there is provided a method of inhibiting expression of endogenous TADG-12 mRNA in a cell by introducing a vector comprising a DNA fragment of TADG-12 in opposite orientation operably linked to elements necessary for expression. As a result, the vector produces TADG-12 antisense mRNA in the cell, which hybridizes to endogenous TADG-12 mRNA, thereby inhibiting expression of endogenous TADG-12 mRNA.

In still yet another embodiment of the present invention, there is provided a method of inhibiting expression of a TADG-12 protein by introducing an antibody directed against a TADG-12 protein or fragment thereof. As a result, the binding of the antibody to the TADG-12 protein or fragment thereof inhibits the expression of the TADG-12 protein.

In another embodiment of the present invention, TADG-12 gene products including the truncated form can be used for targeted therapy. Specifically, a compound having a targeting moiety specific for a TADG-12 protein or its variants and a therapeutic moiety is administered to an individual in need of such treatment. Preferably, the targeting moiety is selected from the group consisting of a n antibody directed against a TADG-12 protein or its variants and a ligand or ligand binding domain that binds a TADG-12 protein. For example, the TADG-12 protein can have an amino acid sequence shown in SEQ ID NOS: 2, 4 or 154. Still preferably, the therapeutic moiety is selected from the group consisting of a radioisotope, a toxin, a chemotherapeutic agent, an immune stimulant and a cytotoxic agent. Such a method can be used for treating an individual having, for example, ovarian cancer, lung cancer, prostate cancer, or colon cancer.

In yet another embodiment of the present invention, there is provided a method of vaccinating, or producing an immune response in, an individual against TADG-12 by inoculating the individual with a TADG-12 protein or fragment thereof. Specifically, the TADG-12 protein or fragment thereof lacks TADG-12 activity, and the inoculation elicits an immune response in the individual, thereby vaccinating the individual against TADG-12. Preferably, the individual has a cancer, is suspected of having a cancer or is at risk of getting a cancer. Still preferably, TADG-12 protein has an amino acid sequence shown in SEQ ID NOS: 2, 4 or 154, while TADG-12 fragment has a sequence shown in SEQ ID NO: 8, or is a 9-residue fragment up to a 20-residue fragment. Examples of 9-residue fragment are shown in SEQ ID NOS: 35, 36, 55, 56, 83, 84, 97, 98, 119, 120, 122, 123 and 136.

In still yet another embodiment of the present invention, there is provided an immunogenic composition, comprising a n immunogenic fragment of a TADG-12 protein and an appropriate adjuvant. Preferably, the immunogenic fragment of the TADG-12 protein has a sequence shown in SEQ ID NO: 8, or is a 9-residue fragment up to a 20-residue fragment. Examples of 9-residue fragment are shown in SEQ ID NOS: 35, 36, 55, 56, 83, 84, 97, 98, 119, 120, 122, 123 and 136.

The following examples are given for the purpose of illustrating various embodiments of the invention and are not meant to limit the present invention in any fashion.

EXAMPLE 1 Tissue Collection and Storage

Upon patient hysterectomy, bilateral salpingo-oophorectomy, or surgical removal of neoplastic tissue, the specimen is retrieved and placed on ice. The specimen was then taken to the resident pathologist for isolation and identification of specific tissue samples. Finally, the sample was frozen in liquid nitrogen, logged into the laboratory record and stored at −80° C. Additional specimens were frequently obtained from the Cooperative Human Tissue Network (CHTN). These samples were prepared by the Cooperative Human Tissue Network and shipped on dry ice. Upon arrival, these specimens were logged into the laboratory record and stored at −80° C.

EXAMPLE 2 mRNA Extraction and cDNA Synthesis

Sixty-nine ovarian tumors (4 benign tumors, 10 low malignant potential tumors and 55 carcinomas) and 10 normal ovaries were obtained from surgical specimens and frozen in liquid nitrogen. The human ovarian carcinoma cell lines SW 626 and Caov 3, the human breast carcinoma cell lines MDA-MB-231 and MDA-MB-435S were purchased from the American Type Culture Collection (Rockville, Md.). Cells were cultured to sub-confluency in Dulbecco's modified Eagle's medium, supplemented with 10% (v/v) fetal bovine serum and antibiotics.

Extraction of mRNA and cDNA synthesis were carried out by the methods described previously (14-16). mRNA was isolated by using a RiboSep mRNA isolation kit (Becton Dickinson Labware). In this procedure, poly A+ mRNA was isolated directly from the tissue lysate using the affinity chromatography media oligo(dT) cellulose. cDNA was synthesized with 5.0 μg of mRNA by random hexamer priming using 1st strand cDNA synthesis kit (CLONTECH).

EXAMPLE 3 PCR with Redundant Primers and Cloning of TADG-12 cDNA

Redundant primers, forward 5′-TGGGTIGTIACIGCIG CICA(CT)TG-3′ (SEQ ID NO: 26) and reverse 5′-A(AG)IA(AG)IGCIATITCI TTICC-3′ (SEQ ID NO: 27), for the consensus sequences of amino acids surrounding the catalytic triad for serine proteases were used to compare the PCR products from normal and carcinoma cDNAs. The appropriate bands were ligated into Promega T-vector plasmid and the ligation product was used to transform JM109 cells (Promega) grown on selection media. After selection of individual colonies, they were cultured and plasmid DNA was isolated by means of the Wizard miniprep DNA purification system (Promega). Nucleotide sequencing was performed using PRISM Ready Reaction Dye Deoxy terminator cycle sequencing kit (Applied Biosystems). Applied Biosystems Model 373A DNA sequencing system was used for direct cDNA sequence determination.

The original TADG-12 subclone was randomly labeled and used as a probe to screen an ovarian tumor cDNA library by standard hybridization techniques (15,17). The library was constructed in λZAP using mRNA isolated from the tumor cells of a stage III/grade III ovarian adenocarcinoma patient. Three overlapping clones were obtained which spanned 2315 nucleotides. The final 99 nucleotides encoding the most 3′ sequence including the poly A tail was identified by homology with clones available in the GenBank EST database.

EXAMPLE 4 Quantitative PCR

The mRNA overexpression of TADG-12 was determined using a quantitative PCR. Quantitative PCR was performed according to the procedure as previously reported (16). Oligonucleotide primers were used for: TADG-12, forward 5′-GAAACATGTCCTTGCTCTCG-3′ (SEQ ID NO: 28) and reverse 5′-ACTAACTTCCACAGCCTCCT-3′ (SEQ ID NO: 29); the variant TADG-12, forward 5′-TCCAGGTGGGTCAGTTTCC-3′ (SEQ ID NO: 30), reverse 5′-CTCTTGGCTTGTACTTGCT-3′ (SEQ ID NO: 31); beta-tubulin, forward 5′-CGCATCAACGTGTACTACAA-3′ (SEQ ID NO: 32) and reverse 5′-TACGAGCTGGTGGACTGAGA-3′ (SEQ ID NO: 33). Beta-tubulin was utilized as an internal control. The PCR reaction mixture consists of cDNA derived from 50 ng of mRNA, 5 pmol of sense and antisense primers for both the TADG-12 gene and the beta-tubulin gene, 200 μmol of dNTPs, 5 μCi of α-³²PdCTP and 0.25 unit of Taq DNA polymerase with reaction buffer (Promega) in a final volume of 25 μl. The target sequences were amplified in parallel with the beta-tubulin gene. Thirty cycles of PCR were carried out in a Thermal Cycler (Perkin-Elmer Cetus). Each cycle of PCR included 30 seconds of denaturation at 94% C, 30 seconds of annealing at 60% C and 30 seconds of extension at 72% C. The PCR products were separated on 2% agarose gels and the radioactivity of each PCR product was determined by using a Phospho Imager (Molecular Dynamics). The present study used the expression ratio (TADG-12/beta-tubulin) as measured by phosphoimager to evaluate gene expression and defined the value at mean +2SD of normal ovary as the cut-off value to determine overexpression. The student's t test was used for comparison of the mean values of normal ovary and tumors.

EXAMPLE 5 Sequencing of TADG-12

Utilizing a plasmid specific primer near the cloning site, sequencing reactions were carried out using PRISM™ Ready Reaction Dye Deoxy™ terminators (Applied Biosystems cat no. 401384) according to the manufacturer's instructions. Residual dye terminators were removed from the completed sequencing reaction using a CENTRI-SEP spin column (Princeton Separation cat. no CS-901). An Applied Biosystems Model 373A DNA Sequencing System was available and was used for sequence analysis.

EXAMPLE 6 Antibody Production

Polyclonal rabbit antibodies were generated by immunization of white New Zealand rabbits with a poly-lysine linked multiple antigen peptide derived from the TADG-12 carboxy-terminal protein sequence NH2-WIHEQMERDLKT-COOH (WIHEQMERDLKT, SEQ ID NO: 34). This peptide is present in full length TADG-12, but not TADG-12V. Rabbits were immunized with approximately 100 μg of peptide emulsified in Ribi adjuvant. Subsequent boost immunizations were carried out at 3 and 6 weeks and rabbit serum was isolated 10 days after the boost inoculations. Sera were tested by dot blot analysis to determine affinity for the TADG-12 specific peptide. Rabbit pre-immune serum was used as a negative control.

EXAMPLE 7 Northern Blot Analysis

10 μg of mRNA were loaded onto a 1% formaldehyde-agarose gel, electrophoresed and blotted on a Hybond-N+ nylon membrane (Amersham). .sup.32P-labeled cDNA probes were made by Prime-a-Gene Labeling System (Promega). The PCR products amplified by the same primers as above were used for probes. The blots were prehybridized for 30 min and hybridized for 60 min at 68% C with ³²P-labeled cDNA probe in ExpressHyb Hybridization Solution (CLONTECH). Control hybridization to determine relative gel loading was performed with the beta-tubulin probe.

Normal human tissues; spleen, thymus, prostate, testis, ovary, small intestine, colon and peripheral blood leukocyte, and normal human fetal tissues; brain, lung, liver and kidney (Human Multiple Tissue Northern Blot; CLONTECH) were also examined by same hybridization procedure.

EXAMPLE 8 Immunohistochemistry

Immunohistochemical staining was performed using a Vectastain Elite ABC Kit (Vector). Formalin fixed and paraffin embedded specimens were routinely deparaffinized and processed using microwave heat treatment in 0.01 M sodium citrate buffer (pH 6.0). The specimens were incubated with normal goat serum in a moist chamber for 30 minutes. TADG-12 peptide antibody was allowed to incubate with the specimens in a moisture chamber for 1 hour. Excess antibody was washed away with phosphate buffered saline. After incubation with biotinylated anti-rabbit IgG for 30 minutes, the sections were then incubated with ABC reagent (Vector) for 30 minutes. The final products were visualized using the AEC substrate system (DAKO) and sections were counterstained with hematoxylin before mounting. Negative controls were performed by using normal serum instead of the primary antibody.

EXAMPLE 9 Isolation of Catalytic Domain Subclones of TADG-12 and TADG-12 Variant

To identify serine proteases that are expressed in ovarian tumors, redundant PCR primers designed to the conserved regions of the catalytic triad of these enzymes were employed. A sense primer designed to the region surrounding the conserved histidine and an anti-sense primer designed to the region surrounding the conserved aspartate were used in PCR reactions with either normal ovary or ovarian tumor cDNA as template. In the reaction with ovarian tumor cDNA, a strong product band of the expected size of approximately 180 bp was observed as well as an unexpected PCR product of approximately 300 bp which showed strong expression in some ovarian tumor cDNA's (FIG. 1A). Both of these PCR products were subcloned and sequenced. The sequence of the subclones from the 180 bp band (SEQ ID NO: 5) was found to be homologous to the sequence identified in the larger, unexpected band (SEQ ID NO: 7) except that the larger band had an additional insert of 133 nucleotides (FIG. 1B). The smaller product of the appropriate size encoded for a protein sequence (SEQ ID NO: 6) homologous to other known proteases while the sequence with the insertion (SEQ ID NO: 8) encoded for a frame shift from the serine protease catalytic domain and a subsequent premature translational stop codon. TADG-12 variants from four individual tumors were also subcloned and sequenced. It was found that the sequence and insert to be identical. The genomic sequences for these cDNA derived clones were amplified by PCR, examined and found to contain potential AG/GT splice sites that would allow for the variant transcript production (data not shown).

EXAMPLE 10 Northern Blot Analysis of TADG-12 Expression

To examine transcript size and tissue distribution, the catalytic domain subclone was randomly labeled and used to probe Northern blots representing normal ovarian tissue, ovarian tumors and the cancer cell lines SW626, CAOV3, HeLa, MD-MBA-435S and MD-MBA-231 (FIG. 2). Three transcripts of 2.4, 1.6 and 0.7 kilobases were observed. In blots of normal and ovary tumor the smallest transcript size 0.7 kb was lowly expressed in normal ovary while all transcripts (2.4, 1.6 and 0.7 kb) were abundantly present in serous carcinoma. In addition, Northern blots representing the normal human tissues spleen, thymus, prostate, testis, ovary, small intestine, colon and peripheral blood leukocyte, and normal human fetal tissues of brain, lung, liver and kidney were examined. The same three transcripts were found to be expressed weakly in all of these tissues (data not shown). A human beta-tubulin specific probe was utilized as a control for relative sample loading. In addition, an RNA dot blot was probed representing 50 human tissues and determined that this clone is weakly expressed in all tissues represented (FIG. 3). It was found most prominently in heart, with intermediate levels in putamen, amygdala, kidney, liver, small intestine, skeletal muscle, and adrenal gland.

EXAMPLE 11 Sequencing and Characterization of TADG-12

An ovarian tumor cDNA library constructed in .lambda.ZAP was screened by standard hybridization techniques using the catalytic domain subclone as a probe. Two clones that overlapped with the probe were identified and sequenced and found to represent 2316 nucleotides. The 97 nucleotides at the 3′ end of the transcript including the poly-adenylation signal and the poly (A) tail were identified by homology with clones available in GenBank's EST database. This brought the total size of the transcript to 2413 bases (SEQ ID NO: 1, FIGS. 4A-4C). Subsequent screening of GenBank's Genomic Database revealed that TADG-12 is homologous to a cosmid from chromosome 17. This cosmid has the accession number AC015555.

The identified cDNA includes an open reading frame that would produce a predicted protein of 454 amino acids (SEQ ID NO: 2), named Tumor Associated Differentially-Expressed Gene 12 (TADG-12). The sequence has been submitted to the GenBank database and granted the Accession No. AF201380. Using homology alignment programs, this protein contains several domains including an amino-terminal cytoplasmic domain, a potential Type II transmembrane domain followed by a low-density lipoprotein receptor-like class A domain (LDLR-A), a scavenger receptor cysteine rich domain (SRCR), and an extracellular serine protease domain.

As predicted by the PRED program, TADG-12 contains a highly hydrophobic stretch of amino acids that could serve as a potential transmembrane domain, which would retain the amino terminus of the protein within the cytoplasm and expose the ligand binding domains and protease domain to the extracellular space. This general structure is consistent with other known transmembrane proteases including hepsin (18), and TMPRSS2 (19), and TADG-12 is particularly similar in structure to the TMPRSS2 protease.

The low-density lipoprotein receptor-like class A domain of TADG-12 is represented by the sequence from amino acid 74 to 108 (SEQ ID NO: 13). The low-density lipoprotein receptor-like class A domain was originally identified within the LDL Receptor (20) as a series of repeated sequences of approximately 40 amino acids, which contained 6 invariant cysteine residues and highly conserved aspartate and glutamate residues. Since that initial identification, a host of other genes have been identified which contain motifs homologous to this domain (21). Several proteases have been identified which contain LDLR-A motifs including matriptase, TMPRSS2 and several complement components. A comparison of TADG-12 with other known low-density lipoprotein receptor-like class A domains is shown in FIG. 5A. The similarity of these sequences range from 44 to 54% of similar or identical amino acids.

In addition to the low-density lipoprotein receptor-like class A domain, TADG-12 contains another extracellular ligand binding domain with homology to the group A scavenger receptor cysteine rich domain family. This family of protein domains typically is defined by the conservation of 6 cysteine resides within a sequence of approximately 100 amino acids (11). The scavenger receptor cysteine rich domain of TADG-12 is encoded by amino acids 109 to 206 (SEQ ID NO: 17), and this domain was aligned with other scavenger receptor cysteine rich domains and found to have between 36 and 43% similarity (FIG. 5B). However, TADG-12 only has 4 of the 6 conserved cysteine residues. This is similar to the scavenger receptor cysteine rich domain found in the protease TMPRSS2.

The TADG-12 protein also includes a serine protease domain of the trypsin family of proteases. An alignment of the catalytic domain of TADG-12 with other known proteases is shown in FIG. 5C. The similarity among these sequence ranges from 48 to 55%, and TADG-12 is most similar to the serine protease TMPRSS2 which also contains a transmembrane domain, low-density lipoprotein receptor-like class A domain and an scavenger receptor cysteine rich domain. There is a conserved amino acid motif (RIVGG) downstream from the scavenger receptor cysteine rich domain that is a potential cleavage/activation site common to many serine proteases of this family (22). This suggests that TADG-12 is trafficked to the cell surface where the ligand binding domains are capable of interacting with extracellular molecules and the protease domain is potentially activated. TADG-12 also contains conserved cysteine residues, amino acids 208 and 243, which in other proteases form a disulfide bond capable of linking the activated protease to the other extracellular domains.

EXAMPLE 12 Quantitative PCR Characterization of TADG-12V

The original TADG-12 subclone was identified as highly expressed in the initial redundant-primer PCR experiment. The TADG-12 variant form (TADG-12V) with the insertion of 133 bp was also easily detected in the initial experiment. To identify the frequency of this expression and whether or not the expression level between normal ovary and ovarian tumors was different, a previously authenticated semi-quantitative PCR technique was employed (16). The PCR analysis co-amplified a product for beta-tubulin with either a product specific to TADG-12 or TADG-12V in the presence of a radiolabelled nucleotide. The products were separated by agarose gel electrophoresis and a phosphoimager was used to quantitate the relative abundance of each PCR product.

Examples of these PCR amplification products are shown for both TADG-12 and TADG-12V in FIG. 6. Normal expression was defined as the mean ratio of TADG-12 (or TADG-12V) to beta-tubulin +/−2SD as examined in normal ovarian samples. For tumor samples, overexpression was defined as >2SD from the normal TADG-12/beta-tubulin or TADG-12V/beta-tubulin ratio. The results are summarized in Tables 1 and 2. TADG-12 was found to be overexpressed in 41 of 55 carcinomas examined while the variant form was present at aberrantly high levels in 8 of 22 carcinomas. As determined by the student's t test, these differences were statistically significant (p<0.05). LMP is a low malignant potential tumor.

TABLE 1 Frequency of Overexpression of TADG-12 in Ovarian Carcinoma Histology Type TADG-12 (%) Normal 0/16 (0%) LMP-Serous 3/6 (50%) LMP-Mucinous 0/4 (0%) Serous Carcinoma 23/29 (79%) Mucinous Carcinoma 7/12 (58%) Endometrioid Carcinoma 8/8 (100%) Clear Cell Carcinoma 3/6 (50%) Benign Tumors 3/4 (75%)

TABLE 2 Frequency of Overexpression of TADG-12V in Ovarian Carcinoma Histology Type TADG-12V (%) Normal 0/10 (0%) LMP-Serous 0/5 (0%) LMP-Mucinous 0/3 (0%) Serous Carcinoma 4/14 (29%) Mucinous Carcinoma 3/5 (60%) Endometrioid Carcinoma 1/3 (33%) Clear Cell Carcinoma N/D

EXAMPLE 13 Immunohistochemical Analysis of TADG-12 in Ovarian Tumor Cells

[In order to examine the TADG-12 protein, polyclonal rabbit anti-sera to a peptide located in the carboxy-terminal amino acid sequence was developed. These antibodies were used to examine the expression level of the TADG-12 protein and its localization within normal ovary and ovarian tumor cells by immuno-localization. No staining was observed in normal ovarian tissues (FIG. 7A) while significant staining was observed in 22 of 29 tumors studied. Representative tumor samples are shown in FIGS. 7B-7C. It should be noted that TADG-12 is found in a diffuse pattern throughout the cytoplasm indicative of a protein in a trafficking pathway. TADG-12 is also found at the cell surface in these tumor samples as expected. It should be noted that the antibody developed and used for immunohistochemical analysis would not detect the TADG-12V truncated protein.

The results of the immunohistochemical staining are summarized in Table 3. 22 of 29 ovarian tumors showed positive staining of TADG-12, whereas normal ovarian surface epithelium showed no expression of the TADG-12 antigen. 8 of 10 serous adenocarcinomas, 8 of 8 mucinous adenocarcinomas, 1 of 2 clear cell carcinomas, and 4 of 6 endometroid carcinomas showed positive staining.

EXAMPLE 14 Splicing Variant TADG-12D

The TADG-12V protein is generated from a splicing error that occurred downstream from the sentinel histidine (FIG. 9). The net outcome of this splicing error is the inclusion of intron sequence downstream from the histidine that has a stop codon 33 amino acids downstream.

A second splicing variant of TADG-12 was identified. This new splicing error occurs downstream from the TADG-12V variant near the sentinel aspartic acid of the serine protease. It also results in the inclusion of intron sequence and a new stop codon. The result is a variant that includes unique amino acid sequence (TADG-12D, FIG. 10). TADG-12D is highly expressed in ovarian carcinoma, but not in normal ovary (FIG. 11). TADG-12D is also expressed in breast and lung carcinoma (FIG. 12). The presence of unique peptide sequence in tumor cells offers a specific target and/or diagnostic marker for cancer treatment and monitoring.

TABLE 3 Case Stage Histology Grade LN* TADG12 Prognosis 1 Normal ovary 0− 2 Normal ovary 0− 3 Normal ovary 0− 4 Mucinous B ND 0− Alive 5 Mucinous B ND 1+ Alive 6 1a Serous LMP G1 ND 1+ Alive 7 1a Mucinous LMP G1 ND 1+ Alive 8 1a Mucinous CA G1 ND 1+ Alive 9 1a Mucinous CA G2 ND 1+ Alive 10 1a Endometrioid CA G1 ND 0− Alive 11 1c Serous CA G1 N 1+ Alive 12 1c Mucinous CA G1 N 1+ Alive 13 1c Mucinous CA G1 N 2+ Alive 14 1c Clear cell CA G2 N 0− Alive 15 1c Clear cell CA G2 N 0− Alive 16 2c Serous CA G3 N 2+ Alive 17 3a Mucinous CA G2 N 2+ Alive 18 3b Serous CA G1 ND 1+ Alive 19 3c Serous CA G1 N 0− Dead 20 3c Serous CA G3 P 1+ Alive 21 3c Serous CA G2 P 2+ Alive 22 3c Serous CA G1 P 2+ Unknown 23 3c Serous CA G3 ND 2+ Alive 24 3c Serous CA G2 N 0− Dead 25 3c Mucinous CA G1 P 2+ Dead 26 3c Mucinous CA G2 ND 1+ Unknown 27 3c Mucinous CA G2 N 1+ Alive 28 3c Endometrioid CA G1 P 1+ Dead 29 3c Endometrioid CA G2 N 0− Alive 30 3c Endometrioid CA G2 P 1+ Dead 31 3c Endometrioid CA G3 P 1+ Alive 32 3c Clear Cell CA G3 P 2+ Dead LN* = Lymph Node: B = Benign; N = Negative; P = Positive; ND = Not Done

EXAMPLE 15 Peptide Ranking

For vaccine or immune stimulation, individual 9-mers to 11-mers of the TADG-12 protein were examined to rank the binding of individual peptides to the top 8 haplotypes in the general population. The computer program used for this analysis can be found on the internet. Table 4 shows the peptide ranking based upon the predicted half-life of each peptide's binding to a particular HLA allele. A larger half-life indicates a stronger association with that peptide and the particular HLA molecule. The TADG-12 peptides that strongly bind to an HLA allele are putative immunogens, and are used to innoculate an individual against TADG-12.

TABLE 4 TADG-12 peptide ranking HLA Type Predicted SEQ & Ranking Start Peptide Dissociation_(1/2) ID NO: HLA A0201  1 40 ILSLLPFEV 685.783 35  2 144 AQLGFPSYV 545.316 36  3 225 LLSQWPWQA 63.342 37  4 252 WIITAAHCV 43.992 38  5 356 VLNHAAVPL 36.316 39  6 176 LLPDDKVTA 34.627 40  7 13 FSFRSLFGL 31.661 41  8 151 YVSSDNLRV 27.995 42  9 436 RVTSFLDWI 21.502 43 10 234 SLQFQGYHL 21.362 44 11 181 KVTALHHSV 21.300 45 12 183 TALHHSVYV 19.658 46 13 411 RLWKLVGAT 18.494 47 14 60 LILALAIGL 18.476 48 15 227 SQWPWQASL 17.977 49 16 301 RLGNDIALM 11.426 50 17 307 ALMKLAGPL 10.275 51 18 262 DLYLPKSWT 9.837 52 19 416 LVGATSFGI 9.001 53 20 54 SLGIIALIL 8.759 54 HLA A0205  1 218 IVGGNMSLL 47.600 55  2 60 LILALAIGL 35.700 48  3 35 AVAAQILSL 28.000 56  4 307 ALMKLAGPL 21.000 51  5 271 IQVGLVSLL 19.040 57  6 397 CQGDSGGPL 16.800 58  7 227 SQWPWQASL 16.800 49  8 270 TIQVGLVSL 14.000 59  9 56 GIIALILAL 14.000 60 10 110 RVGGQNAVL 14.000 61 11 181 KVTALHHSV 12.000 45 12 151 YVSSDNLRV 12.000 42 13 356 VLNHAAVPL 11.900 39 14 144 AQLGFPSYV 9.600 36 15 13 FSFRSLFGL 7.560 41 16 54 SLGIIALIL 7.000 54 17 234 SLQFQGYHL 7.000 44 18 217 RIVGGNMSL 7.000 62 19 411 RLWKLVGAT 6.000 47 20 252 WIITAAHCV 6.000 38 HLA A1  1 130 CSDDWKGHY 37.500 63  2 8 AVEAPFSFR 9.000 64  3 328 NSEENFPDG 2.700 65  4 3 ENDPPAVEA 2.500 66  5 98 DCKDGEDEY 2.500 67  6 346 ATEDGGDAS 2.250 68  7 360 AAVPLISNK 2.000 69  8 153 SSDNLRVSS 1.500 70  9 182 VTALHHSVY 1.250 71 10 143 CAQLGFPSY 1.000 72 11 259 CVYDLYLPK 1.000 73 12 369 ICNHRDVYG 1.000 74 13 278 LLDNPAPSH 1.000 75 14 426 CAEVNKPGV 1.000 76 15 32 DADAVAAQI 1.000 77 16 406 VCQERRLWK 1.000 78 17 329 SEENFPDGK 0.900 79 18 303 GNDIALMKL 0.625 80 19 127 KTMCSDDWK 0.500 81 20 440 FLDWIHEQM 0.500 82 HLA A24  1 433 VYTRVTSFL 280.000 83  2 263 LYLPKSWTI 90.000 84  3 169 EFVSIDHLL 42.000 85  4 217 RIVGGNMSL 12.000 62  5 296 KYKPKRLGN 12.000 86  6 16 RSLFGLDDL 12.000 87  7 267 KSWTIQVGL 11.200 88  8 81 RSSFKCIEL 8.800 89  9 375 VYGGIISPS 8.000 90 10 110 RVGGQNAVL 8.000 91 11 189 VYVREGCAS 7.500 92 12 60 LILALAIGL 7.200 48 13 165 QFREEFVSI 7.200 93 14 271 IQVGLVSLL 7.200 57 15 56 GIIALILAL 7.200 60 16 10 EAPFSFRSL 7.200 94 17 307 ALMKLAGPL 7.200 51 18 407 CQERRLWKL 6.600 95 19 356 VLNHAAVPL 6.000 39 20 381 SPSMLCAGY 6.000 96 HLA B7  1 375 VYGGIISPS 200.000 97  2 381 SPSMLCAGY 80.000 98  3 362 VPLISNKIC 80.000 99  4 35 AVAAQILSL 60.000 56  5 373 RDVYGGIIS 40.000 100  6 307 ALMKLAGPL 36.000 51  7 283 APSHLVEKI 24.000 101  8 177 LPDDKVTAL 24.000 102  9 47 EVFSQSSSL 20.000 103 10 110 RVGGQNAVL 20.000 91 11 218 IVGGNMSLL 20.000 55 12 36 VAAQILSLL 12.000 104 13 255 TAAHCVYDL 12.000 105 14 10 EAPFSFRSL 12.000 94 15 138 YANVACAQL 12.000 106 16 195 CASGHVVTL 12.000 107 17 215 SSRIVGGNM 10.000 108 18 298 KPKRLGNDI 8.000 109 19 313 GPLTFNEMI 8.000 110 20 108 CVRVGGQNA 5.000 111 HLA B8  1 294 HSKYKPKRL 80.000 112  2 373 RDVYGGIIS 16.000 100  3 177 LPDDKVTAL 4.800 102  4 265 LPKSWTIQV 2.400 113  5 88 ELITRCDGV 2.400 114  6 298 KPKRLGNDI 2.000 109  7 81 RSSFKCIEL 2.000 89  8 375 VYGGIISPS 2.000 97  9 79 RCRSSFKCI 2.000 115 10 10 EAPFSFRSL 1.600 94 11 215 SSRIVGGNM 1.000 108 12 36 VAAQILSLL 0.800 104 13 255 TAAHCVYDL 0.800 116 14 381 SPSMLCAGY 0.800 98 15 195 CASGHVVTL 0.800 107 16 362 VPLISNKIC 0.800 99 17 138 YANVACAQL 0.800 106 18 207 ACGHRRGYS 0.400 117 19 154 SDNLRVSSL 0.400 118 20 47 EVFSQSSSL 0.400 103 HLA B2702  1 300 KRLGNDIAL 180.000 119  2 435 TRVTSFLDW 100.000 120  3 376 YGGIISPSM 100.000 121  4 410 RRLWKLVGA 60.000 122  5 210 HRRGYSSRI 60.000 123  6 227 SQWPWQASL 30.000 49  7 109 VRVGGQNAV 20.000 124  8 191 VREGCASGH 20.000 125  9 78 YRCRSSFKC 20.000 126 10 113 GQNAVLQVF 20.000 127 11 91 TRCDGVSDC 20.000 128 12 38 AQILSLLPF 20.000 129 13 211 RRGYSSRIV 18.000 130 14 216 SRIVGGNMS 10.000 131 15 118 LQVFTAASW 10.000 132 16 370 CNHRDVYGG 10.000 133 17 393 GVDSCQGDS 10.000 134 18 235 LQFQGYHLC 10.000 135 19 271 IQVGLVSLL 6.000 57 20 408 CQERRLWKL 6.000 95 HLA B4403  1 427 AEVNKPGVY 90.000 136  2 162 LEGQFREEF 40.000 137  3 9 VEAPFSFRS 24.000 138  4 318 NEMIQPVCL 12.000 139  5 256 AAHCVYDLY 9.000 140  6 98 DCKDGEDEY 9.000 67  7 46 FEVFSQSSS 8.000 141  8 38 AQILSLLPF 7.500 129  9 64 LAIGLGIHF 7.500 142 10 192 REGCASGHV 6.000 143 11 330 EENFPDGKV 6.000 144 12 182 VTALHHSVY 6.000 145 13 408 QERRLWKLV 6.000 146 14 206 TACGHRRGY 4.500 147 15 5 DPPAVEAPF 4.500 148 16 261 YDLYLPKSW 4.500 149 17 33 ADAVAAQIL 4.500 150 18 168 EEFVSIDHL 4.000 151 19 304 NDIALMKLA 3.750 152 20 104 DEYRCVRVG 3.600 153

The following references were cited herein.

-   1. Duffy, Clin. Exp. Metastasis, 10: 145-155, 1992. -   2. Monsky et al., Cancer Res., 53: 3159-3164, 1993. -   3. Powell et al., Cancer Res., 53: 417-422, 1993. -   4. Neurath. The Diversity of Proteolytic Enzymes. In: R. J. Beynon     and J. S. Bond (eds.), pp. 1-13, Proteolytic enzymes, Oxford: IRL     Press, 1989. -   5. Liotta et al., Cell, 64: 327-336, 1991. -   6. Tryggvason et al., Biochem. Biophys. Acta., 907: 191-217, 1987. -   7. McCormack et al., Urology, 45:729-744, 1995. -   8. Landis et al., CA Cancer J. Clin., 48: 6-29, 1998. -   9. Mahley, Science 240: 622-630, 1988. -   10. Van Driel et al., J. Biol. Chem. 262: 17443-17449, 1987. -   11. Freeman et al., Proc. Natl. Acad. Sci. USA 87: 8810-8814, 1990. -   12. Aruffo et al., Immunology Today 18(10): 498-504, 1997. -   13. Torres-Rosado et al., Proc. Natl. Acad. Sci. USA, 90: 7181-7185,     1993. -   14. Tanimoto et al., Tumor Biology, 20: 88-98, 1999. -   15. Maniatis, T., Fritsch, E. F. & Sambrook, J. Molecular     Cloning, p. 309-361 Cold Spring Harbor Laboratory, New York, 1982. -   16. Shigemasa et al., J. Soc. Gynecol. Invest., 4:95-102, 1997. -   17. Underwood et al., Cancer Res., 59:4435-4439, 1999. -   18. Leytus et al., Biochemistry, 27: 1067-1074, 1988. -   19. Paoloni-Giacobino et al., Genomics, 44: 309-320, 1997. -   20. Sudhof et al., Science, 228: 815-822, 1985. -   21. Daly et al., Proc. Natl. Acad. Sci. USA 92: 6334-6338, 1995. -   22. Rawlings and Barrett, Methods Enzymology 244: 19-61, 1994.

Any patents or publications mentioned in this specification are indicative of the levels of those skilled in the art to which the invention pertains. These patents and publications are herein incorporated by reference to the same extent as if each individual publication was specifically and individually indicated to be incorporated by reference.

One skilled in the art will readily appreciate that the present invention is well adapted to carry out the objects and obtain the ends and advantages mentioned, as well as those inherent therein. The present examples along with the methods, procedures, treatments, molecules, and specific compounds described herein are presently representative of preferred embodiments, are exemplary, and are not intended as limitations on the scope of the invention. Changes therein and other uses will occur to those skilled in the art which are encompassed within the spirit of the invention as defined by the scope of the claims. 

1. A DNA fragment encoding a Tumor Associated Differentially-Expressed Gene-12 (TADG-12) protein or splicing variant of TADG-12, said DNA comprising: (a) an isolated DNA fragment encoding a TADG-12 protein that has the amino acid sequence of SEQ ID NO: 2; (b) an isolated DNA fragment encoding a TADG-12 splicing variant that has an amino acid sequence comprising part of SEQ ID NO. 2; and (c) an isolated DNA fragment differing from the isolated DNA fragments of (a) and (b) above in codon sequence due to the degeneracy of the genetic code, and which encodes a TADG-12 protein or a splicing variant of TADG-12.
 2. The DNA fragment of claim 1, wherein the DNA fragment has a sequence shown in SEQ ID NO: 1, 3 or
 155. 3. The DNA fragment of claim 1, wherein the variant TADG-12 protein has an amino acid sequence shown in SEQ ID NO: 4 or
 154. 4. A vector comprising the DNA fragment of claim 1 and regulatory elements necessary for expressing said DNA in a cell.
 5. The vector of claim 4, wherein said DNA fragment encodes a TADG-12 protein having the amino acid sequence shown in SEQ ID NO: 2, 4 or
 154. 6. A host cell transfected with the vector of claim 4, said vector expressing a TADG-12 protein.
 7. The host cell of claim 6, wherein the cell is a bacterial cell, a mammalian cell, a plant cell, or an insect cell.
 8. An antisense oligonucleotide complementary to the DNA fragment encoding a splicing variant of TADG-12 protein of claim
 1. 9. An isolated and purified TADG-12 splicing variant protein encoded for by a DNA comprising: (a) isolated DNA encoding a TADG-12 splicing variant, said variant protein having an amino acid sequence comprising part of SEQ ID NO. 2; and (b) isolated DNA differing from the isolated DNA of (a) above in codon sequence due to the degeneracy of the genetic code, and which encodes a splicing variant of TADG-12.
 10. The isolated and purified TADG-12 splicing variant protein of claim 9, wherein the TADG-12 protein has an amino acid sequence shown in SEQ ID NO:
 154. 11. An antibody directed against the TADG-12 splicing variant protein of claim
 9. 12. A method for detecting malignant hyperplasia in a biological sample, comprising: (a) isolating mRNA from the sample; and (b) detecting in the sample mRNA encoding a TADG-12 splicing variant protein, wherein the presence of the mRNA in said sample indicates the presence of malignant hyperplasia, wherein the absence of the mRNA in the sample indicates the absence of malignant hyperplasia.
 13. The method of claim 12, wherein the detection method is PCR amplification, Northern blot, dot blot or a DNA array chip.
 14. The method of claim 13, wherein the PCR amplification uses primers having a nucleotide sequence shown in SEQ ID NOS: 28-31.
 15. The method of claim 12, wherein the TADG-12 splicing variant protein has an amino acid sequence shown in SEQ ID NO:
 154. 16. The method of claim 12, wherein said biological sample is blood, urine, saliva, tears, interstitial fluid, ascites fluid, tumor tissue biopsy, or circulating tumor cells.
 17. A method for detecting malignant hyperplasia in a biological sample, comprising: (a) isolating protein from the sample; and (b) detecting in said sample a TADG-12 splicing variant protein, wherein the presence of the TADG-12 splicing variant protein in said sample indicates the presence of malignant hyperplasia, wherein the absence of the TADG-12 protein variant in said sample indicates the absence of malignant hyperplasia.
 18. The method of claim 17, wherein the detection method is Western blot, dot blot, ELISA sandwich assay, radioimmunoassay or flow cytometry analysis.
 19. The method of claim 17, wherein the TADG-12 protein has an amino acid sequence shown in SEQ ID NO:
 154. 20. The method of claim 17, wherein the biological sample is blood, urine, saliva, tears, interstitial fluid, ascites fluid, tumor tissue biopsy, or circulating tumor cells.
 21. A method of targeted therapy to an individual, comprising: administering to an individual a compound having a targeting moiety and a therapeutic moiety, wherein the targeting moiety is specific for a TADG-12 splicing variant protein.
 22. The method of claim 21, wherein the targeting moiety is an antibody directed against a TADG-12 splicing variant protein or a ligand or ligand binding domain that binds a TADG-12 splicing variant protein.
 23. The method of claim 21, wherein the TADG-12 protein has an amino acid sequence shown in SEQ ID NO:
 154. 24. The method of claim 23, wherein the therapeutic moiety is a radioisotope, a toxin, a chemotherapeutic agent, an immune stimulant, or a cytotoxic agent.
 25. The method of claim 23, wherein the individual suffers from ovarian cancer, lung cancer, prostate cancer, or colon cancer.
 26. A method of vaccinating an individual against TADG-12, comprising: inoculating the individual with a TADG-12 splicing variant protein or fragment thereof, wherein the inoculation elicits an immune response in the individual, thereby vaccinating said individual against TADG-12.
 27. The method of claim 26, wherein the individual has a cancer, is suspected of having a cancer or is at risk of getting a cancer.
 28. The method of claim 26, wherein the TADG-12 splicing variant protein has an amino acid sequence shown in SEQ ID NO:
 154. 